Information geometric approach to the renormalisation group 
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We propose a general formulation of the renormalisation group as a family of quantum channels 
which connect the microscopic physical world to the observable world at some scale. By endowing the 
set of quantum states with an operationally motivated information geometry, we induce the space of 
Hamiltonians with a corresponding metric geometry. The resulting structure allows one to quantify 
information loss along RG flows in terms of the distinguishability of thermal states. In particular, 
we introduce a family of functions, expressible in terms of two-point correlation functions, which are 
non increasing along the flow. Among those, we study the speed of the flow, and its generalization 
to infinite lattices. 



The renormalisation group (RG) , one of the most pro- 
found ideas in science, allows us to understand long-range 
physics without requiring us to completely describe the 
fundamental constituents of the universe. This is because 
small-scale microscopic behaviour can be absorbed by a 
handful of parameters of some effective theory. 

The RG is typically taken to act on the space of the- 
ories or, more concretely, the space K, of Hamiltonians, 
and is thus a kind of superoperation which produces from 
a given initial Hamiltonian an effective Hamiltonian at 
some larger scale. By repeatedly composing the RG op- 
eration one generates a flow or trajectory in /C. This 
flow typically possesses attractive fixed points/surfaces 
corresponding to the physics at large lengthcales. 

The fact that vastly different theories can converge to 
the same submanifold characterized by a handful of rele- 
vant parameters — the phenomenon of "universality" — 
invites an information-theoretic explanation [TJ. Indeed, 
because the renormalisation procedure ignores progres- 
sively larger scale features of the theory, its irreversibility 
should be directly related to a loss of information about 
the microscopic physics. 

Previous work in this direction has focused on the at- 
tempt to derive a function on the manifold K, which al- 
ways decreases along the RG flow, except at a fixed point, 
which would prove the irreversibility of the renormalisa- 
tion procedure. For 1 + 1-dimensional quantum fields, 
this is achieved by Zamolodchikov's c-theorem |2|, which 
establishes the existence of such a scalar function, ex- 
pressible in terms of the correlation functions of the the- 
ory. Similar results have been obtained for higher dimen- 
sional theories 0] . More direct information-theoretic 
approaches were also explored in the context of classical 
field theory [5) [5], or using ground-state entanglement [7]. 

Here we propose a different approach based on an 
information-theoretic formulation of the renormalisation 
group. We argue that the RG is naturally expressed in 
terms of a markov process on the manifold of statistical 
quantum states. Because thermal states essentially spec- 
ify their Hamiltonian, we can pull back any information- 
theoretic structure on the manifold of statistical quan- 
tum states to the manifold of Hamiltonians. In this con- 
text, the information being lost along the RG flow is that 
which measures distinguishability between different theo- 




FIG. 1. Artist's impression of the manifold of states S and the 
manifold of Hamiltonians KL. The manifolds are related by the 
exponential map, and their tangent spaces by its derivative 
fl p . Here p — e K jZ and x = Q p (a). 



Ties. This gives rise to an information metric on K. which 
allows for the derivation of QFT-fricndly c-like quanti- 
ties. 

Suppose that the fundamental constituents of the sys- 
tem under study are quantum particles/degrees of free- 
dom interacting at some extremely short length scale e. 
We write the (possibly mixed) quantum state of this sys- 
tem as p £ . When we perform an experiment at a larger 
length scale £ 3> e we are making a very imperfect mea- 
surement: we are ignoring many unobservable degrees of 
freedom. The most general procedure to "ignore" de- 
grees of freedom consistent with the laws of quantum 
mechanics is described by a quantum channel £, i.e., a 
completely positive and trace-preserving map. We thus 
write the best description of the quantum state available 
to our experiments as pi — £i(p £ ), where £i denotes the 
quantum channel with ignores (or discards) all the de- 
grees of freedom we cannot observe. Typically in the 
context of Wilsonian renormalisation [5] and the func- 
tional RG the map £% is the partial trace channel over all 
momenta \k\ » For our purposes, we suppose that 
knowing the effective state at any intermediate scale is 
enough to obtain the effective state at a larger scale. We 
call I <— > £ e the renormalisation "group" or process. 

In order to connect this flow on states to a flow on 
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Hamiltonians, we assume that the fundamental state p e 
is in thermal equilibrium at a temperature r, and hence 
has the form p e = e~ H ^ T jZ for some Hamiltonian H e . 
Furthermore, if the temperature is defined by a reser- 
voir with which our system is in contact, it is natural 
that the effective states pg = £g(p £ ) are thermal states at 
the same temperature r. The effective Hamiltonian Hg 
at scale I can then be defined so that pi = e Hl / T /Z, or 
Hg = —r\og(Zpg). The partition function is independent 
of £, thanks to the fact that the maps Eg are trace preserv- 
ing. Since the temperature is assumed to be constant, 
we can work with the operator Kg = —Hg/r instead of 
Hg and, abusing language, call those Hamiltonians. The 
basic arena of our study is therefore the manifold JC of 
Hermitian operators K with the same Z — Tre 7 ^. 

On a finite-dimensional Hilbert space, the process Eg 
is generated by a Lindblad superoperator Cg, so that 
jgPi = Cg(pg). In order to further simplify the presen- 
tation, we assume a constant generator C. This may 
require the parameter of the flow to be different from the 
lengthscale £, albeit still a monotone function of I. In 
order to differentiate between them, we therefore denote 
that parameter t, and write ^pt = £(pt), so that the 
family of channels £ t — e tc is the semigroup generated 
byC. 

We define the effective Hamiltonian at the scale in- 
dexed by t to be K t such that p t = e Kt jZ. The evolution 
of the Hamiltonian K t = log(Zp t ) under the renormali- 
sation group is given by 

jK t = B { K t) 

with tangent field B{K) = Qr£C{e K ) = Q' 1 ^), which 
we expressed in terms of the useful linear superopera- 
tor 
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This superoperator can be understood as a noncommu- 
tative generalization of the operation of dividing by p. 
In geometric terms, it pushes forward a tangent vector 
at p = e K /Z to the corresponding tangent vector at K. 
We also need its inverse 
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On the manifold of states, since the diffeomorphism 
e tc is linear, its push-forward on tangent vectors is just 
itself. Hence, on the manifold of Hamiltonians, it maps 
the tangent vector a at K to a t = n~ t 1 e tc £l p (a), at the 
point K t . The infinitesimal form is 



dt 



at 



—B(K t + ea t )\ e=0 . =: V Pt a t , 



(2) 



Although our set-up is finite-dimensional, it would be 
natural to identify B{K) with the beta-function of quan- 
tum field theory in the following way. For a translation- 
invariant QFT one would usually assume that K has the 



form K = J dxJ2i9i®i( x )> where the family {§i(x)}°^ 
spans all local field operators at x, and gi are the cor- 
responding coupling constants. The generator B(K) 
then should have the form B(K) = J dx^2 i Pi(K) §i(x), 
where the component f3i(K) £ K is the beta- function for 
the coupling constant gi. 

Information geometry — Recall that vectors tangent to 
the Hamiltonian K are observables with zero expectation 
value with respect to the state e K jZ. For any two such 
observables a and &, we define the bilinear form 

(a,b) K :=-^ jr F(K + ta + sb) = Tr (a Sl p (b)), (3) 
at os o 

where F(K) = — log Tr e K is the free energy for the 
Hamiltonian H — ~tK. 

This defines a Ricmannian metric on the manifold of 
observables. The corresponding metric on the manifold 
of states is the Kubo-Mori metric: Given two tangent 
(traceless) vectors x and y at p, 



( x ,y)f 



d 2 



dt ds 



S(p + tx + sy\\p) =Ti(xfl- l (y)), (4) 



where S(p'\\p) = Tr [p 1 (log p' — logp)} is the relative en- 
tropy [10] . an information-theoretic quantity which mea- 
sures how distinguishable the two states p and p' are |11) . 
This gives an operational meaning to our metric. We can 
think of the norm yj (a, a) „ of a small tangent vector 
a as measuring how distinguishable the perturbed state 
e K+a /Z is from e K /Z. 

There are two compatible interpretations of the rela- 
tion between these bilinear forms, connected to the dual 
role played by observables in equilibrium statistical me- 
chanics. On one hand, observables are perturbations of 
Hamiltonians, and are hence tangent vectors to the man- 
ifold of Hamiltonians, connected to the manifold of states 
via the exponential diffeomorphism. On the other hand, 
observables can be seen as cotangent vectors on the man- 
ifold of states, since they are meant to be combined with 
states to yield an expectation value. 

The main property of these dual metrics is that the 
norm of a tangent vector cannot increase under the action 
of a quantum channel [H] . On states this means that for 
any channel M and tangent vector x, 



(Af{x),Af(x)) Ar(p) < (x,x) f 



(5) 



In classical probability theory, there exists only one met- 
ric with this property: the Fisher-Rao metric, which 
has been extensively used in statistical physics (e.g. see 
Refs. [T31 - fT5] in the context of renormalization) . The 
metric defined in Eq. Qis one of a family of noncommu- 
tative generalizations jl6j , and reduces to the Fisher-Rao 
metric for commuting operators and states. 

It follows from this contraction property that, along 
the flow generated by B on the manifold of Hamiltonians, 
for any tangent vector a, 



dt 



(a t ,a t ) Kt < 0, 



(G) 
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which, more explicitly, is equivalent to the statement that 
for all p = e K JZ, and for all tangent vectors a, 



2 (a,V p (a)) K + Tr (aSl„(a)) < 0, 



(7) 



where the f2 p denotes the derivative of fi p with respect 
to t. 

Non-increasing scalar - - Motivated by Zamolod- 
chikov's c-theorem we want to build a scalar field 
K i y f(K) on the manifold K, of Hamiltonians which 
has the property that it never increases under the renor- 
malisation flow generated by B, i.e. such that 



dt 



f{K t ) < 



(8) 



and, if possible, such that it stops decreasing only at a 
fixed point. 

In the case where an exact fixed point of the renor- 
malisation semigroup is known, i.e., a state a such that 
C(a) = 0, then any contractive measure of distinguisha- 
bility with respect to a has that property. An example 
would be the relative entropy: f{K) — S(e K /Z\\a). In- 
deed, S(p 1 \\p 2 ) satisfies S(£(p 1 )\\£(p 2 )) < S(p 1 \\p 2 ) for 
any quantum channel £. It has a straightforward inter- 
pretation: it measures the distinguishability with respect 
to the fixed point. Moreover, if a is the maximally mixed 
state, i.e. the infinite temperature fixed point, then this 
function is simply a constant minus the entropy. How- 
ever, this presupposes that the fixed point towards which 
the system flows is known. In addition, it is a very diffi- 
cult quantity to compute. We propose instead a quantity 
which can be computed locally directly in terms of cor- 
relation functions. 

In view of Eq. ((6), the norm of any observable, which 
is expressed in terms of a Green function, does decrease 
along the flow. That quantity is not purely a function 
of the Hamiltonian K. However, it can be made so if 
the observable is replaced by a tangent field K i-> A(K) 
which is invariant under the flow, i.e. such that, for all t, 
A(K t ) is equal to the image under the flow of the tangent 
vector A(Kq). Using Eq. ([2]), the infinitesimal version of 
this condition is 



dt 



A{K t ) = V Pt {A{K t )), 



(9) 



where p t = e Kt /Z. This is equivalent to the statement 
that the Lie derivative of A with respect to B is zero. An 
example is A = B itself, or fields of the form A(K) = 
fri£'(e x ), where £ is any linear superoperator which 
commutes with C. Indeed, if a field A satisfies Eq. 
then Eq. ^ implies that the scalar 



f(K) = (A(K),A(K)) 



K 



(10) 



where p = e K /Z, satisfies Eq. Q. For instance, if we 
use A := B, then f(K) is the squared speed of the flow, 
f(K) = (B(K),B(K))x measured in the distinguishabil- 
ity metric, which is zero if and only if B(K) = 0, i.e. at 



a fixed point. In the classical setting, this corresponds to 
the "velocity function" of Ref. [14] . 

Infinite lattices — We have so far worked within the 
framework of finite-dimensional Hilbert spaces, and for a 
generic semigroup £t- However, to study renormalisation 
more specifically, we may need to consider infinite lattices 
and translation-invariant states, for which quantities like 



those defined in Eq. ( 10 ) are usually infinite. The natural 



course of action is to consider a density corresponding to 
f(K). But, as we will see, this presents severe issues if 
the renormalisation flow includes an explicit rescaling of 
the lattice. 

Therefore, we first consider the case where the renor- 
malisation flow commutes with translations, which means 
that it consists only of decimation with no active rescal- 
ing of length. Explicitly, if T x is the superoperator im- 
plementing a translation of the state by the lattice vector 
x, we require that CT X — T X C. 

In addition, we require that the renormalisation flow 
stays inside a submanifold A4 of local translation in- 
variant Hamiltonians, which implies that, for any K G 
M 1 B(K) = J2 x bx(K), where the operators b x (K) — 
T x (b(K)) act only on a finite number of sites, and T x is 
a linear superoperator implementing a translation by the 
lattice vector x. 

We then define 



f(K) = J2iMK),b x (K)) K = (b(K),B(K)) 



K ■ 



(11) 



This quantity corresponds to the speed density of the 
Sow. We show that under the above assumptions, it is 
indeed non-increasing along the flow, i.e. -^f(K t ) < 0. 
First we observe that, since [T X ,W P ] — and T x (p) = p, 
then 

(VAA) = (V p T y b,b x ) = (V p b 7 T- y b x ) - (V p b,b x - y ). 

(12) 

Let B L := Y1\ X \<L ^x(K), and Vl be the number of trans- 
lations satisfying \x\ < L. Then Eq. Q implies 

> 2(W P B L , B L ) + Tr {B L tl p {B L )) 



E 

\y\<L,\x\<L 



2(V P 6, b x _ y ) + Ti (bn p (b x _ y )) 



[2(V p b,b x ) + Tr (b h p {b x )) 



where Jl{%) counts the number of identical elements in 
the double sum. It is such that 

lim TrE/^) U{V P b,b x ) +Tr (b(l p (b x )) 

x 

= 2(V P 6,6) +Tr(6f2 p (£0) 

which then must also be negative. We conclude by show- 
ing that ^- t f{K t ) is equal to that last quantity. The 
derivative -^.f(K t )\ t=0 is made of three terms, one of 
which is equal to Tr (£>J7 p (£>)). The second term is just 
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(b, ±B\ ) = (b,VB) = (Vb,B) by virtue of Eq. jl2j. The 
last term is 

d b{K t )\ Q ,B) =J2db(K t )\ ,T x b) = (±B(K t )\ Q ,b) 



= {V p {B),b) = {V p (b),B), 



which concludes the proof. 



The quantity defined in Eq. (11) may fail to be non- 



increasing if the flow does not commute with translations. 
For a counterexample, we consider the one-dimensional 
Ising model with the flow generated by the superoperator 
C(p) = J^i [Tr i(p) ~ p] i where the sum is over all lattice 
sites and Tr j traces out the ith site and then, for all j >i, 
replaces the jth sites by the states of the (j + l)th site, 
i.e. it shifts the spins on the right of the ith in order to fill 
the gap. A good property of this renormalisation group 
is that it does not increase the range of the interactions. 
Hence the nearest-neighbour Ising Hamiltonian stays 
within the manifold of commuting translation-invariant 
Hamiltonians acting only on nearest-neighbours, which 
is three-dimensional. The Hamiltonians can be written 
as K(J, h,c) — J £\ SiS i+ i + hj^i Si + cl, where Si is 
the diagonal matrix acting on site i with eigenvalues +1 
and — 1. In finite dimensions, the expectation value of 
B(K) would alway s be zero provided that the semigroup 
is trace-preserving. However, here one needs to choose 
the parameter c so that the free energy density is zero, 
i.e. c = — log Z. 

On the stable submanifold defined by c = 
-log(2coshJ) and ft, = 0, we have B{K) = J2i b i(K) 
with bi(K(J)) = -e~ J smh(J)SiSi +1 - log(2cosh(J))l 



QI]. We then obtain f(K) 



-•2,3 



tanh(J)] . One can 



check J > always decreases along the flow, but f(K) at- 
tains a maximum for a finite J, which indeed contradicts 
the proposition that f{K) be always decreasing along the 
flow. 

Example — Here we introduce an example of a Lind- 
blad generator which can be defined on any lattice and 
generates a renormalisation flow which commutes with 
translations (and hence does no active rescaling). Given 
this property, the quantity defined in Eq. (Ill is non- 
increasing. The Lindblad generator is 



(13) 



where the sum is over all neighboring pairs of spins on 
the lattice, and Uij denotes the unitary map swapping 
sites i and j. 

The interpretation of this generator is that, after a 
small "time" e, we have a probability e of confusing 
any given neighbouring pair of sites, hence losing short 
scale information. In fact, as we will see, a finite time 
t amounts to erasing information up to a length scale 
I oc \ft. Hence the corresponding generator for the flow 
in terms of the lengthscale i is = p£. 

The fixed points of this generator, i.e. states a such 
that C(a) = 0, are precisely the states invariant under 




FIG. 2. The quantity f(K) denned in Eq. versus the 
temperature in unit of J/ks for the two-dimensional Ising 
model on periodic square lattices computed using a Monte 
Carlo simulation. The vertical dashed line represents the 
exact transition temperature r c . The different curves corre- 
spond to lattice sizes 10 2 , 20 2 , 40 2 . The fitting function is of 
the form a/r 2 ' 1 (dashed curve) for r > r c and c — d(r c — r) e 
for r < T c , convoluted with a Gaussian to account for the 
finite-size effect. 



permutations of any two sites. According to the quantum 
de Finetti theorem, this implies that any reduced state on 
a finite number of sites has the form J d/i(p) p® N where 
/i is a probability measure on mixed states. Hence, due 
to the absence of active rescaling, the exact fixed points 
all correspond to mean-field theories. More interesting 
critical states must be identified through the asymptotic 
behaviour of the flow under a passive rescaling of the 
lattice spacing, i.e., in the continuum limit. 

Let's study the effect of the semigroup on a one- 
dimensional lattice. It is best interpreted in the Heisen- 
berg picture, where it maps observables of the effec- 
tive coarse-grained theory to actual physical observables 
of the microscopic theory. For instance, the action of 
the generator on an observable dj at site i is £^(a.;) = 
a i+i ~ 2a.j + <ii—\. 

We recognize a finite-difference approximation of a dif- 
fusion equation. Therefore, for large "time" t, we ex- 
pect that any local operator of the form a(/) = /idi, 
were fi > and J2 i fi = 1, will be mapped to 



" («(/)) 



JL 

/47Tt 



Jk o it afe. In particular, we see 
that the physical resolution of our observables filtered by 
e tc is proportional to \ft. 

Figure [2] shows the non-increasing quantity defined in 
Eq. ( j 1 1 [ ) for the two-dimensional Ising model on a square- 
lattice computed using a Monte Carlo simulation. We see 
that it peaks at the transition temperature, and does not 
appear to diverge. 

Outlook — We have proposed a formulation of the 
RG on Hamiltonians, where the loss of information can 
be defined and quantified precisely in terms of the loss 
of distinguishability between the corresponding thermal 
states. In particular, the semigroup generated by C de- 
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fined in Eq. ( 13 1 is a promising candidate for a definition 



of the RG leading to a proper continuum limit and asso- 
ciated RG flow. Further work is necessary to understand 
the problems that our approach encounters when dealing 
with active or passive rescaling of length. 
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